NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Convex Relaxation Approach to Generalization Analysis for Parallel Positively Homogeneous Networks

Tadipatri, Uday; Haeffele, Benjamin; Agterberg, Joshua; Vidal, Rene (April 2025, AISTATS)

Free, publicly-accessible full text available April 30, 2026
Unsupervised Wave Physics-Informed Representation Learning for Guided Wavefield Reconstruction

Harley, Joel B; Haeffele, Benjamin; Tetali, Harsha Vardhan (February 2024, Report and statement of accounts Printing and Publishing Industry Training Board)
Blasch, Erik; Darema, Frederica; Aved, Alex (Ed.)
Full Text Available
Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Chu, Tianzhe Chu; Tong, Shengbang; Ding, Tianjiao; Dai, Xili; Haeffele, Benjamin; Vidal, René; Ma, Yi (May 2024, International Conference on Learning Representations (ICLR))

The advent of large pre-trained models has brought about a paradigm shift in both visual representation learning and natural language processing. However, clustering unlabeled images, as a fundamental and classic machine learning problem, still lacks an effective solution, particularly for large-scale datasets. In this paper, we propose a novel image clustering pipeline that leverages the powerful feature representation of large pre-trained models such as CLIP and cluster images effectively and efficiently at scale. We first developed a novel algorithm to estimate the number of clusters in a given dataset. We then show that the pre-trained features are significantly more structured by further optimizing the rate reduction objective. The resulting features may significantly improve the clustering accuracy, e.g., from 57\% to 66\% on ImageNet-1k. Furthermore, by leveraging CLIP's multimodality bridge between image and text, we develop a simple yet effective self-labeling algorithm that produces meaningful captions for the clusters. Through extensive experiments, we show that our pipeline works well on standard datasets such as CIFAR-10, CIFAR-100, and ImageNet-1k. It also extends to datasets that are not curated for clustering, such as LAION-Aesthetics and WikiArts. We released the code in https://github.com/LeslieTrue/CPP
more » « less
Full Text Available
Wave Physics-Informed Matrix Factorizations

https://doi.org/10.1109/TSP.2023.3348948

Tetali, Harsha Vardhan; Harley, Joel B.; Haeffele, Benjamin D. (January 2024, IEEE Transactions on Signal Processing)

With the recent success of representation learning methods, which includes deep learning as a special case, there has been considerable interest in developing techniques that incorporate known physical constraints into the learned representation. As one example, in many applications that involve a signal propagating through physical media (e.g., optics, acoustics, fluid dynamics, etc.), it is known that the dynamics of the signal must satisfy constraints imposed by the wave equation. Here we propose a matrix factorization technique that decomposes such signals into a sum of components, where each component is regularized to ensure that it nearly satisfies wave equation constraints. Although our proposed formulation is non-convex, we prove that our model can be efficiently solved to global optimality. Through this line of work we establish theoretical connections between wave-informed learning and filtering theory in signal processing. We further demonstrate the application of this work on modal analysis problems commonly arising in structural diagnostics and prognostics.
more » « less
Full Text Available
White-Box Transformers via Sparse Rate Reduction

Yu, Yaodong; Buchanan, Sam; Pai, Druv; Chu, Tianzhe; Wu, Ziyang; Tong, Shengbang; Haeffele, Benjamin; Ma, Yi (December 2023, the Thirty-seventh Annual Conference on Neural Information Processing Systems)

In this paper, we contend that the objective of representation learning is to compress and transform the distribution of the data, say sets of tokens, towards a mixture of low-dimensional Gaussian distributions supported on incoherent subspaces. The quality of the final representation can be measured by a unified objective function called sparse rate reduction. From this perspective, popular deep networks such as transformers can be naturally viewed as realizing iterative schemes to optimize this objective incrementally. Particularly, we show that the standard transformer block can be derived from alternating optimization on complementary parts of this objective: the multi-head self-attention operator can be viewed as a gradient descent step to compress the token sets by minimizing their lossy coding rate, and the subsequent multi-layer perceptron can be viewed as attempting to sparsify the representation of the tokens. This leads to a family of white-box transformer-like deep network architectures which are mathematically fully interpretable. Despite their simplicity, experiments show that these networks indeed learn to optimize the designed objective: they compress and sparsify representations of large-scale real-world vision datasets such as ImageNet, and achieve performance very close to thoroughly engineered transformers such as ViT. Code is at https://github. com/Ma-Lab-Berkeley/CRATE.
more » « less
Full Text Available
Unsupervised Manifold Linearizing and Clustering

Ding, Tianjiao; Tong, Shengbang; Chan, Ryan; Dai, Xili; Ma, Yi; Haeffele, Benjamin (October 2023, International Conference on Computer Vision)

We consider the problem of simultaneously clustering and learning a linear representation of data lying close to a union of low-dimensional manifolds, a fundamental task in machine learning and computer vision. When the manifolds are assumed to be linear subspaces, this reduces to the classical problem of subspace clustering, which has been studied extensively over the past two decades. Unfortunately, many real-world datasets such as natural images can not be well approximated by linear subspaces. On the other hand, numerous works have attempted to learn an appropriate transformation of the data, such that data is mapped from a union of general non-linear manifolds to a union of linear subspaces (with points from the same manifold being mapped to the same subspace). However, many existing works have limitations such as assuming knowledge of the membership of samples to clusters, requiring high sampling density, or being shown theoretically to learn trivial representations. In this paper, we propose to optimize the Maximal Coding Rate Reduction metric with respect to both the data representation and a novel doubly stochastic cluster membership, inspired by state-of-the-art subspace clustering results. We give a parameterization of such a representation and membership, allowing efficient mini-batching and one-shot initialization. Experiments on CIFAR-10, -20, -100, and TinyImageNet-200 datasets show that the proposed method is much more accurate and scalable than state-of-the-art deep clustering methods, and further learns a latent linear representation of the data.
more » « less
Full Text Available
Variational Information Pursuit for Interpretable Predictions

Chattopadhyay, Aditya; Chan, Ryan; Haeffele, Benjamin; Geman, Donald; Vidal, Rene (July 2023, ICLR)

Full Text Available
Interpretable by Design: Learning Predictors by Composing Interpretable Queries

https://doi.org/10.1109/TPAMI.2022.3225162

Chattopadhyay, Aditya; Slocum, Stewart; Haeffele, Benjamin D.; Vidal, René; Geman, Donald (June 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
A Critique of Self-Expressive Deep Subspace Clustering

Haeffele, Benjamin D; You, Chong; Vidal, Rene (April 2022, International Conference on Learning Representations)

Full Text Available
A novel variational form of the Schatten-p quasi-norm

Giampouras, Paris; Vidal, Rene; Rontogiannis, Athanasios; Haeffele, Benjamin D. (December 2020, Advances in neural information processing systems)

Full Text Available

Search for: All records